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ABSTRACT 

A technique for classifying higher education 
institutions was developed in order to identify homogenous subsets of 
institutions and to compare an institution with its empirically 
determined peers. The majority of the data were obtained from a 
4-year longitudinal file that merged the finance, faculty, 
enrollment, and institutional characteristics survey .of the Higher 
Education General Information Survey (1975-1978). The methodology 
included factor analysis of 189 variables and cluster analysis of 
institutions using information on the number, type, and levels of 
degrees awarded. The major doctoral institutions/research clustered 
into three major groups and two singlets. The 82 major doctoral 
institutions/non-research clustered into seven groups, four of which 
were singlets; the 287 comprehensive universities clustered into 10 
groups, with 5 being sMiglets. Finally, the 686 geneiral baccalaureate 
institutions clustered into 12 groups, with 4 being singlets, and 2 
groups having 3 or fewer institutions. Appended materials include 
descriptions of the following eight dimensions that were used to 
cluster institutions: tuition dependence, instructional emphasis, 
black student enrollment, facilities, research emphasis, growth, 
size, and endowment level. (SVf) 
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Introduction . 

Traditional classification systems of higher education 
institutions, such as the Carnegie classifications or the newer 
NCHEMS classifications, separate, institutions into very broad, 
summary categories that in many aspects have as much diversity within 
classes as between. As a result, little is gained in making 
comparisons among such broadly defined sectors or in using these 
classifications in micro-level analyses of individuals. On the other 
hand it is not feasible to compare institutions on a one-to-one 
basis, nor' would such an analysis be particularly i?€levant to the 
higher education community. Additionally, in looking at individual 
data in relation to higher education experiences or outcomes, 
previous work in this area suggests that there is sufficient- noise in 
singular institutional char acteri stipes to obscure any significant 
findings. ' • 

The purpose of the study was to identify higher-order dimensions 
of institutional characteristics that would permit homogeneous 
subsets of institutions to be identified. These subsets could then 
be used to compare an institution with its empirically determined 
peers It would also provide school determinants with sufficient 
reliability such that it might be possible to detect interactions 
between higher education institutional characteristics^ and student 
characteristics. ^ . 

The Data 

Data for the study came from a four-year longitudinal file that 
^was developed for NCES. This file merged the finance, faculty, 
enrollment, and institutional characteristics survey of HEGIS for 
1975 through 1978. Additionally, several data elements from the ACE 
Title III files and BEOGS program data files were included. A 
summary of the variables and their source is contained in Table 1. 
All finance variables were adjusted for size using full-time 
equivalent enrollment. 

The Methodology 

Since it was felt that many of the HEGIS variables would be 
highly correlated, 189 variables representing each of the domains 
listed in Table 1 were factor analyzed using an alpha factor analytic 
technique. The resulting independent dimensions were rotated .to 



simple structure using the varimax criteria, and each institution was 
scored on each of the resulting factors, for use in subsequent 
analyses. 

In defining homogeneous subsets of institutions, the analysis 
was 'limited to four-year schools. In addition, because the* number 
and types of degrees awarded, and the programmatic emphasis of the 
institution were institutional dimensions that had not been included 
in the factor analysis, the . clustering , of institutions was carried 
out within categories representing the number, type, and levels of 
degrees awarded. 

In particular, separate cluster analyses were done for major 
doctoral/research institutions, major doctoral/non-research 
institutions, comprehensive institutions and general baccalaureate 
institutions. Clusters of institutions were developed using the 
Fastclus subroutine of the Statistical Analysis System. This 
subroutine forms a cluster by examining the linear distance of' every 
element to be clustered (in this case institutions) from a pre- 
defined cluster center, and assigning an element to a particular 
cluster on the basis of that distance. Since the number of clusters 
is defined a priori, the computational algorithm functions within the 
pre-defined number of clusters, rather than selecting an optimal 
number of clusters. Obtaining the optimal number of clusters is an 
empirical process, although by looking at the maximum distance within 
a cluster relative to the distance between cluster centers, it is 
possible to achieve a good fit to the data. For. each of the four 
categories of institutions, the same factors were used to cluster 
institutions. 

As a measure of the goodness of fit of the cluster solution to 
the data, discriminant analyses were performed. Since one objective 
of discriminant analysis is to determine the probability of group 
membership, a posteriori , it was felt that if all schools within a 
given cluster had a high probability of being in that cluster and a 
low probability of being in all other clusters, then the number of 
clusters could be considered sufficient, although not necessarily 
optimal. 



The Results 

The factor analysis resulted in 27 factors with eigenvalues 
greater than one. These accounted for 90% of the variability in the 
correlation matrix. Since many of these factors either accounted for 
a very small proportion of variance or were not relevant to the 
4-year institution being considered, eight factors, accounting for 
42% of the variance, were used in the cluster analysis. These eight 
factors" are described in Table 2. 

In considering these factors, it is interesting to note that 2 
factors - tuition dependence and endowment - represent sources of 
Institutional support, 2 represent institutional mission 



instruction and research, and 2 represent different dimensions of 
size - size of physical plant and number of students. . 

The factors were used in separate cluster analyses for the 4 
institutional types and the results of these analy§efi will be 
discussed separately. 



Major Doctoral Institutions/Research 

These institutions clustered into three major clusters and two 
singletg. Singlets generally represent either very distinct 
institutions or extreme anoma^lies in the data. Table 3 presents the 
mean , scores' of the ' five clusters on each of the eight dimensions. 
For schools of this type, tuition dependence^ instructional emphasis, 
research'' emphasis, and endowment levels are important dimensions. 
Although clusters were defined on the basis of eight independent 
dimensions, it is possible to get a graphic perspective of the 
results of the cluster analysis by looking at plots in 2-dimensional 
space- Figure 1 illustrates the five clusters of "major doctoral 
institutions that are research-oriented for the tuition dependence 
and instructional emphasis dimensions. As may be seen, even on just 
two dimensions there is excellent separation between clusters, and 
integrity withing clusters. - ' - 

Table 4 presents the results of the discriminant analysis 
procedure - indicating that esach of the major clusters were 
sufficient, whereas singlets were placed in the cluster that they 
were closest to. 

Major Doctoral Institutions/Non-research 

Clustering this set of 82 institutions resulted in seven 
clusters, 4 of which were singlets. Table 5 contains the mean score 
of each cluster on each of the eight dimensions. For the three major 
clusters, tuition dependence, instructional emphasis, and endowment 
level seem to be the critical dimensions of difference, although the 
facilities dimension is also important in separating clusters A and 
C. A two-dimensional pictorial representation of the separation of 
these clusters ^is presented in Figure 2 - a plot of the endowment 
level" and tuition dependence dimensions. The regults of the 
discriminant analysis for these seven clusters are available in Table 
6. 



Comprehensive Universities 

The 287 institutions in this class clustered into 10 groups, 
with 5 being singlets. This means of the clusters on the eight 
dimensions, shown in Table 7, indicate that tuition dependence 
separated all five primary clusters, while instructional emphasis 
distinguished clusters C and I from H. black student enrollment .s, 
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distinguished G from the other 4 major clusters, facilities separated 
cluster H from cluster C, G, and I, and endowment level separated J 
and>H from clusters C, G and I. The two-dimensional plot of 
endowment level by instructional emphasis shows some ambiguity among 
the defined major clusters, and suggests the importance of 
multidimensional criteria for clustering institutions. This 
ambiguity in two dimensions notwithstanding, the results of the 
discriminant analyses shown in Table 8 indicate the sufficiency of 5 
major clusters for fitting these institutions. 

General Baccalaureate Institutions 

The 686 institutions in this class clustered into 12 groups, 
with 4 being singlets, and 2 groups having 3 or fewer institutions. 
The means of the 12 clusters on the eight dimensions are shown in 
Table 9 Among the six major clusters, tuition dependence, black 
student enrollment, and endowment seem to be the critical dimensions 
in separating general batfcalureate institutions. Two 2-dimen^ional 
plots actually illustrate how the multidimensional criteria for 
clustering operates. Figure 4 plots black student enrollment and 
tuition dependence. It can be seen in this plot how black students 
enrollment differentiates between schools in cluster D and all other 
clusters, while tuition dependence separates cluster E,. L, and H. 
These two dimensions do not function well in separating clusters H 
and K, however. Figure 5 plots endowment level by tuition 
dependence. Here it may be seen that endowment separates cluster H' 
and K but cluster D is indistinct from cluster E. The result of the 
discriminant analysis are presented in Table 10. These discriminant 
analysis results are not quite as confirming as those achieved with 
the other three classes of institutions, suggesting either that more 
clusters might be appropriate, or that the dimensions on which these 
clusters were based were not as , relevant to • this class of 
institutions as they were for the three other institutional types. 

Summary 

The results of this study clearly demonstrate the viability- of 
developing homogeneous subsets of institutions that can be used^for 
comparing an institution with its empirically determined peers. Yet, 
the results also have methodological implications that should be 
considered in making practical use of this technique. 

The results for comprehensive and general baccalaureate 
institutions indicated a need for using multiple dimensions that are 
also relevant to particular types of institutions if -homogeneous 
clusters of institutions are to be developed and if definitive 
separation among clusters is to be achieved. 

The results also suggest that the composition of a particular 
cluster is somewhat dependent on the dimensions used for clustering - 
that is, if different dimensions were used, institutions would 
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cluster in slightly different ways. Thus'-, in using this technique, 
consideration - should be given to the comparisons to be made among 
peer institutipns so that the dimensions used for clustering are 
consonant with the desired comparisons. 
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Table 1. Domains of variables used in the analysis and source. 



Variable Domain 



Current funds revenues by source 

Current Funds Expenditures and 
Mandatory Transfers 

Physical Plant Assets 

Indebtedness on Physical Plant 

Endowment Assets 

Changes in Fund Balances - 
Net Increases /Decreases 

Number of Faculty 

Ethnicity of Students 

Number of Students 

Tuition 

Admission Requirements 

Number of BEOG's Awards, 
Amount of Awards, and 
Title III Institution 



Source 

REGIS Finance Survey - Part A, 1978 
REGIS Finance Survey - Part B, 1978 

REGIS Finance Survey - Part C, 1978 
REGIS Finance Survey - Part D, 1978 
REGIS Finance Survey - Part E, 1978 
REGIS Finance Survey - Part F, 1978 

REGIS Employee Survey, 1978 

REGIS Enrollment Survey, 1978 

REGIS Enrollment Survey, 1978 

REGIS Institutional Characteristics 
Survey, 1978 " ' . 

REGIS Institutional Characteristics 
Survey, 1978 

ACE Title III File 
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Table 2. Description of Eight Dimensions Used to Cluster Institutions 



.Factor 



Name 



Description 



Tuition Dependence 



II 



Instructional 
Emphasis 



III 



Black Student 
Enrollment 



IV 



Facilities ' 



Research Emphasis 



Schools having a high score on this factor 
both\ spend and earn money on auxiliary 
enterprises such as bookstores, athletics,, 
dormitories, and food services. A high 
proportion of the revenue of these schools 
comes from student tuition and tuition 
charges for both undergraduate and graduate 
students are high. Additionally, schools 
with a high score on this factor tend to 
provide scholarships or grants for students, 
'>have a small number of part-time students, 
and a high number of faculty for the number . 
of students In the school. 

Schools with a high score on this factor 
spend a disproportionate amount on instruc- 
tion and services that support instruction - 
SMch as audio/visual services, computing 
support, and course and curriculum development, 
as well as libraries, general administrative 
services, and research. These schools tend 
to receive revenues from State, Federal and 
private sources which are earmarked either 
for specific research projects of instrucr 
tlonal or public service programs. 

Schools with a high score on this factor tend 
to have a large number of students who rieceive 
substantial Basic Educational Opport;,unity Grant 
assistance. In addition, they have i high 
proportion of black undergraduate and graduate 
students and tend to award institutional grants 
to students. 

Schools with a high score on this factor either 
have a very large campus or exist in an area 
where building and land costs are high. In 
order to maintain these facilities, shcools 
with a high score on this factor spend a dis- 
proportibnate amoimt direlrtl^ 

nance and operation of their buildings and land, 
and in ancillary services such as general 
administration and community relations. 

Schools with a high score on this factor spend 
and earn revenues from major Federally funded 
research and development centers that are inde- 
pendent of the, institution's primary mission. 
They also tend to emphasize research within the 
institution and probably add equipment in support 
of this research function. Ifa addition to Federal 
sources of ^revenues, these schools receive money 
for specific projects through private gifts and 
endowment income. 
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Table 2. Continued « 



\ 



Factor 



VI 



Name 



Growth 



VII 



Size 



VIII 



Endowment Level 



f. Description 

Schools with a high score on this factor have 
had a large increase in funds available for 
the renewal and replacement of their physical 
plant and have apparently used these funds to 
add buildings to their physical plant. These 
schools have also acquired additional equip- 
ment, probably in support of instruction and 
public service functions. 

Schools with a high score on this factor are 
large and have an even larger proportion of 
graduate students. Th^y tend to receive money ' 
from State appropriations and pay higher than 
average faculty salaries. 

Schools having a high score on this factor 
would tend to have very large endowments with 
attendant high earnings, and realized income. 
Additionally, they would be building their 
endowments at a fairly high rate, and maybe 
channeling endowment income into the support.- 
of libraries, scholaVsjhips and services that 
support the institution's primary missions of 
instruction, research, and/or public services. 



ERIC 



Table 3* Results of tUe cluster Analyals for major 
Doctoral instltutlons/rcsear^lu 



CLUSTER 

A 

B 
C 
D 
E 



CLUSTER SUnHARY 

heuiers hax distance froh center 



49 

f 
I 
1 



24. 18821 
12.95492 
18.SS724 
• 



CLUSTER Tuition Depehd Instruc EioDh, Blk Stud Enr 



A 
B 

C 
D 
E 



1.75444814 

14.59455145 
28.92550527 
21.78947071 
59.94152995 



5. 
14. 



.78159484 

.51555851 
28.49288845 
57.25514575 
54.0244924S 



-2 
-5 
-4 
-4 
-10 



12572811 
,94978442 
,24404450. 
,44554427 
. 11148824 



CLUSTER CENTERS 
Facilities Researcjf Emph 



0.97079145 

7.59874799 
15.52744908 
17.44724975 
54. S97 17257 



5.12519849 
12.59994448 
29.57582547 
28.92942721 
141.94147504 



Growth 

5.92795525 
4.41585420 
11.44745555 
28.49545400 
28.79441989 



Size 

8. 19425014 
4.84414285 

8.90552251 
7.40082215 
10.58120897 



EndowM 

1.0S|«4^ 
14.5558411 
40.494002* 
20. 150907^ 
84.839128^ 



Table 4. Number and Percent of Major Doctoral/Research Institutions 
Classified into Five Clusters by Discriminant Analysis. 

Classified into Cluster 
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Total 
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Number 


Percent 


Kumber 


Percent 


Number 


Percent 


Number 


Percent 


A 


48 


98.96 


1 


2.04 


0 


0.00 


49 


100.00 


B 


0 


0.00 


10 


100.00 


0 


0.00 


10 


- 100.00 


C 


0 


0.00 


0 


0.00 


9 


100.00 
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100.00 
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0.00 


1 


100.00 




0.00 


' 1 


100.00 
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0.00 
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0.00 
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100.00 
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100.00 
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Figure 1. '^Plot of, ln«tructtonal cmphaslfi bv tuition dependence for 
Major ""Doctornl In)Jtltutlon«/Ucscarch 
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iabiti 3* Kcstiits oi tiiL* cluster analysis lor llajor 
Doctor«il Inntltutiont/tlon-rcsQaVch. 



CLUSTER 

A 
D 
C 
D 
E 
F 
G 



CLUSTER SUnriAKY 
HEHIERS HAX 0I5TANC6 FROH CFHTER 



5 
1 

5S 

I 

15 
1 
1 



7.274152 
0 

13.47907 
0 

7.I30U9 
0 
• 



Tuition Depend Instruc Emph Blk Stud Enr 



'14.0^7780352 
22.8^2809882 
-0.396554^74 
-3.42459K03 

9.017500907 
27.4^99290^8 

5.931223478 



10.707351700 
11.487342413 
1.042404834 
1.908547975 
4.910537191 
17.754859872 
11.789270180 



-3.773245507 
-4.927870033 
-1.427575221 
-2.416775952 
-3.442787044 
-5.505009107 
10. 149844994 



CLUSTER CENTERS 
Facilities Research Emph 



4.53.1741299 

9.410040408 
-0.305585233 
-0.432911748 

2.532431859 
10.757514594 

5.929405414 



7.821139942 
14.170494317 
0.444429247 
0.477478419 
3.478251949 
13.432979435 
4.231070414 



Growth 

3.035495294 
5.022840448 
1.331317844 
2.803244481 
1.527802249 
5.044074083 
10.328449010 



Size 

2.572774700 
2.444147272 
3.988229198 
30.711117480 
2.471914484 
3.870321385 
5.844314345 



Fndnu 



14.47959988'. 
47.28344720: 
-0.41958375* 
-1.93072920; 

7.03940108' 
31.381829971 

3.5413814M 



Table 6. Number and Percent of Major Doctoral/Npn-research Institutions 
Classified into Seven Clusters by Discrlninant Analysis. 



Classified from 
Cluster 



Classified Into Cluster 
C E 



Total 





Number 


Percent 


Number 


Percent 


Number 


Percent 


Number 


Percent 


A 


5 


100.00 


0 


0.00 


0 


0.00 


5 


100. OG 


B 


I 


100.00 


0 


0.00 


0 


0.00 


1 


100.00 


C 


0 


0.00 


58 


100.00 


0 


0.00 


58 


100.00 


D 


0 


0.00 


1 


100.00 


0 


0.00 


1 


100.00 


E 


0 


0.00 


0 


0.00 


15 


100.00 


15 


100.00 


F 


1 


100.00 


0 


0.00 


0 ■ 


"i'- - 0.00 


1 


100.00 


G 


0 


0.00 


1 


100.00 


0 


0.00 


1 


100.00 
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Fl^.ure 2. riot of EiidotTncnt Level by Tuition Dependence for 
Major Doctoral Institutions/Non-research. 
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2.}«II0(^2 
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-I.ff(7»l3( 
*S.7U(27S« 
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•S.III7I 1^5 
•I.IMI72«« 
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«.f«lt72(7 
II. 2fS 17429 



(.99)292(2 
).7(90I»I9 
|.9^19«»2S 
I.I(((12I9 
I. 31992)99 
I.I4722I9I 
2.^1197111 
1.(1379111 
l.3^9S39l( 
4.II9SSIS4 



-1.25577991 
-3. 15519575 
-I.37I9?59I 
-I.379ll<il7 
•II. :^1(0«I( 
-2.:7I5II5I 
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•3.II^(22U 



3.II(3I2H 
I. 27131199 
>I.2^(I793( 
-2.734(1525 
59.43395237 
-2.5I7I5IU 
-I. 21523231 
I.I2I3375I 
1.57213991 
3.4II7IS94 



9. |75((497 
l(. 1915(791 
-•.(I217(3( 
-1.393(1954 
49.324 34914 
I. II2III2I 
I. 42325191 
1. 44511114 
I.S7521I53 
S.S24I«4II 



Growth 

f .97341195 
11.42511111 
-I. 11257377 
3.I5I3I47I 
S3.II9I7I4I 
1.(1511(75 
1.9751(121 
2.4IS4S272 
•I. 1(122(13 
1. 1(995752 



Sisi 

2.94377( 
5.3I54IK 
I.Kdd 

11.215992 
9.53(153 

SI. 595(27 
1.91(147 
•.32121(, 
1.4727751 
I.l5(f 174 



Entfowscnt 

4. III7III5 
II.20I12III 
•1.157(1991 
-2.5(497451 
57.12797927 
-2.(t349(47 
*l. 42155317 
27.7(21(531 

1.737417(5 
12.54111131 



Tftbli I. Number ind Percent of Conpreheniive Institutions Cliiiifiid 
Into Ten Cluiteri by Diicrlaiaant Analyiii. 



ClH«lfied froa 
Cluatar 



Claaalflcd Into Cluatar 
H I 



Total 





Number 


Percent 


Nuaber 


Percent 


Nuaber 


Percent 


Number 


Percent 


Nuaber 


Percent 


Nuaber 


Percent 


k 


0 


0.00 


0 


0.00 


0 


0.00 


0 


0.00 


1 


100.00 




100.00 


1 


0 


0.00 


0 


0.00 


0 


0.00 


0 


0.00 


1 


100.00 




100.00 


c 


179 


99.44 


a 


0.56 


0 


0.00 


0 


0.00 


0 


0.00 


110 


100.00 


D 


1 


100.00 


0 


0.00 
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0.00 


0 


0.00 


0 


0.00 




100.00 
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0.00 
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0.00 
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0.00 
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0.00 




100.00 
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0 
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10 
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Table 9. Results of the cluster analysl* for General 
Baccalaureate Institutions. 
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Tftbla 10. Number to* Percent of General lecceUureete Inetitutioiw 
ClmitU* Into T^lvt CliMtere by Olecrlaloaot AnalyeU. 
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Figure 4. Plot of BJack Student Enrol Inent by Tuition Dependence for 
General Baccalaureate Institutional 
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Figure 5. Plot of Rndlot/ment Level by Tuition Dependence for 
General Baccalaureate Institutions. « 
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